CLAIMS: 

/ \ , . a method fo, classifying a text object, comprising: 
„ V I extracting a set o, features from the text object; 

^ ' . „^f„^» set with ones of the set of features 

constructing a document class fuzzy set wrth 

,uzzy sets and the document class fuzzy set; and 

U sin 9 ,e measured degree o, match to assign the text object a labe, 

making rule. 



10 



that satisfies a selected decisior 



to claim 1, further comprising learning each 
base. 



15 



2. The method according 
class fuzzy set in the knowledge I 

3 . The method according to claim 2, wherein each class fuzzy so, 

learned by: 1 

obtaining a set of class training documents; 

urging those training documents in the set of training documents w,th 



20 



25 



similar labels to create a class document; and 

computing a class fuzzy sit using the class document. 
4. The method according <o Calm 1 . herein the se, o, features ,s 
extracted from the text object by: \ 

tokenizing the document to generate a word list; 

parsing the word lis, to gelrate the se, o, grammar based ,ea,ures; 

" »er,ng the se, o, gramma\based features * 

fuzzy set. \ 
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5. The method according to claipn 1 , wherein the document fuzzy set is 
computed by: 

calculating a frequency of occurrence for each feature in the set of 
features in the text object; 

normalizing the frequency of occurrence of each feature in the set of 
features; and 

transforming the normalized frequency of occurrence of each feature in 
the set of features to define the document fuzzy set. 

6. The method according to claim 6, wherein the normalized frequency 
of occurrence of each feature in the set of features is transformed using a 
bijective transformation. 

7. The method according to claim 1, wherein the degree of match 
between each of the plurality of class fuzzy sets and the document fuzzy set is 
measured using one of a maximum-minimum strategy and a probabilistic 
reasoning strategy based upon semantic unification. 

slaim 1 , further comprising: 

tch with an associated class specific filter 
function to define an activation value for its associated class rule; 

identifying the activation value of the class rule with the highest 
activation value; each class rule having an associated class label; and 

assigning the class label of the class rule with the highest identified 
activation value to classify the text object into one of the plurality of class fuzzy 
sets. 



8. The method according to 
filtering each degree of ma; 



9. The method according tol claim 8, further comprising learning each 
25 associated class specific filter function. 

10. The method according to claim 1, wherein the decision making rule 
is used to identify one of a maximum value, a threshold value, and a 
predefined number. 
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1 1 . A method for classifying a XeA object, comprising: 

extracting a set of granule features from the text object; 

constructing a document granule feature fuzzy set using ones of the 
granule features extracted from the t3xt object; each of the ones of the 
5 granule features extracted from the text object having a degree of 



10 



15 



20 



25- 



membership in a corresponding granu 
granule feature fuzzy set and a plurality 
a knowledge base; 



computing a degree of match between each of the plurality of class 
granule feature fuzzy sets and the document granule feature fuzzy set to 
provide a degree of match for each of the ones of the granule features; 

aggregating each degree of match of the ones of the granule features 



e feature fuzzy set of the document 
of class granule feature fuzzy sets of 



to define an overall degree of match foi 



each feature; and 



using the overall degree of match for each feature to assign the text 
object a class label that satisfies a selected decision making rule. 

12. The method according to claim M, further comprising filtering the 
granule features extracted from the text object to define the ones of the 
granule features used to construct the document granule feature fuzzy set. 

13. The method according to jclaim 12, wherein the filtering of the 
granule features is based upon one c^f Zipf's law and semantic discrimination 
analysis. 

j 

14. The method according to^laim 11, wherein each granule feature is 
represented by a plurality of fuzzy s^fsand associated labels. 

15. The method according to cljaim 1 1 , wherein the ones of the granule 
features that are used to construct th^ document granule feature fuzzy set are 
reduced to one of a predefined threshold number of granule features and 
range of granule features. 

16. The method according to jclaim 11, further comprising learning each 
granule fuzzy set in the knowledge base 
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17. The method according to claim 11, wherein the degree of match 
between each of the plurality of class granule feature fuzzy sets and the 
document granule feature fuzzy set is measured using one of a maximum- 
minimum strategy and a probabilistic Reasoning strategy based upon semantic 
unification. 

18. The method according to £laim 1 1 , further comprising: 

filtering each degree of match with an associated class specific filter 
function to define an activation valu 3 for its associated class rule; 

identifying the activation value of the class rule with the highest 
activation value; each class rule haying an associated class label; and 

the class rule with the highest identified 
activation value to classify the te^t object into one of the plurality of class 
granule feature fuzzy sets. 

claim 18, further comprising learning each 
associated class specific filter function. 

20. The method according to claim 11, wherein each degree of match 
is aggregated using one of a produc t and an additive model. 

21. The method according o claim 11, further comprising estimating 
granule feature weights when they are aggregated as a weighted function. 



22. A text categorizer for classifying a text object, comprising: 



assigning the class label of 



19. The method according to 



a knowledge base for storinc 
associated class labels; 



categories represented by fuzzy sets and 



representing extracted features from the 
and 



a pre-processing module for 
text object as a document fuzzy set; 

an approximate reasoning rjiodule for using a measured degree of 
match between the fuzzy sets in the knowledge base and the document fuzzy 
set to assign the associated class labels of those categories that satisfy a 
selected decision making rule. 
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23. The text categorizer of cla m 22, wherein the fuzzy sets are granule 
fuzzy sets. f ^ 

24. The text categorizer of claim 22, wherein the fuzzy sets are class 
fuzzy sets. { 

25. The text categorizer of 1 claim 24, further comprising a learning 
module for learning the class fuzzy sets. 

26. The text categorizer of claim 25, further comprising: 



a training database for creating a plurality of class documents; and 

a validation database for validating learned class fuzzy sets in the 



knowledge base. 
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